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Abstract 

In this paper we propose and analyze a distributed algorithm for achieving globally optimal 
decisions, either estimation or detection, through a self-synchronization mechanism among linearly 
coupled integrators initialized with local measurements. We model the interaction among the nodes 
as a directed graph with weights (possibly) dependent on the radio channels and we pose special 
attention to the effect of the propagation delay occurring in the exchange of data among sensors, as 
a function of the network geometry. We derive necessary and sufficient conditions for the proposed 
system to reach a consensus on globally optimal decision statistics. One of the major results proved 
in this work is that a consensus is reached with exponential convergence speed for any bounded delay 
condition if and only if the directed graph is quasi-strongly connected. We provide a closed form 
expression for the global consensus, showing that the effect of delays is, in general, the introduction 
of a bias in the final decision. Finally, we exploit our closed form expression to devise a double- 
step consensus mechanism able to provide an unbiased estimate with minimum extra complexity, 
without the need to know or estimate the channel parameters. 



1 Introduction and Motivations 



Endowing a sensor network with self-organizing capabilities is undoubtedly a useful goal to increase the 
resilience of the network against node failures (or simply switches to sleep mode) and avoid potentially 
dangerous congestion conditions around the sink nodes. Decentralizing decisions decreases also the 
vulnerability of the network against damages to the sink or control nodes. Distributed computation 
over a network and its application to statistical consensus theory has a long history (see, e.g., [1, 2j), 
including the pioneering work of Tsitsiklis, Bertsekas and Athans on asynchronous agreement problem 
for discrete-time distributed decision-making systems [3] and parallel computing [H [S]. A sensor 
network may be seen indeed as a sort of distributed computer that has to evaluate a function of the 
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measurements gathered by each sensor, possibly without the need of a fusion center. This problem may 
be addressed by taking into account the vast literature on distributed consensus/agreement algorithms. 
These techniques have received great attention in the recent years within the literature on cooperative 
control and multiagent systems [6] — [22]. In particular, the conditions for achieving a consensus over 
a common specified value, like a linear combination of the observations, was solved for networked 
continuous-time dynamic systems by Olfati-Saber and Murray, under a variety of network topologies, 
also allowing for topology variations during the time necessary to achieve consensus [318]. The discrete- 
time case was addressed by Tsitsiklis in [3] (see also [5] ) . Many recent works focused on the distributed 
computation of more general functions than the average of the initial measurements. These include 
average-max-min consensus [12], geometric mean consensus [I3j, and power mean consensus [14j . A 
study on the class of smooth functions that can be computed by distributed consensus algorithms was 
recently addressed in [15] . Application of consensus algorithms to data fusion problem and distributed 
filtering was proposed in \16\ I17j. The study of the so called alignment problem, where all the agents 
eventually reach an agreement, but without specifying how the final value is related to the initial 
measurements, was carried out in [18j-|22j. Two recent excellent tutorials on distributed consensus 
and agreement techniques are [23] and |24j . 

Consensus may be also seen as a form of self-synchronization among coupled dynamical systems. 
In [25\ [26] , it was shown how to use the self-synchronization capabilities of a set of nonlinearly coupled 
dynamical systems to reach the globally optimal Best Linear Unbiased Estimator (BLUE), assuming 
symmetric communication links. As well known, the BLUE estimator coincides with the Maximum 
Likelihood (ML) estimate for the linear observation model with additive Gaussian noise. In particular, 
it was shown in [26] that reaching a consensus on the state derivative, rather than on the state itself (as 
in [7] — [23]), allows for better resilience against coupling noise. In [27], the authors provided a discrete- 
time decentralized consensus algorithm to compute the BLUE, based on linear coupling among the 
nodes, as a result of a distributed optimization incorporating the coupling noise. 

The consensus protocols proposed in [7], [9] — [21] and [Mj — [27] assume that the interactions among 
the nodes occur instantaneously, i.e., without any propagation delay. However, this assumption is not 
valid for large scale networks, where the distances among the nodes are large enough to introduce a 
nonnegligible communication delay. There are only a few recent works that study the consensus or 
the agreement problem, in the presence of propagation delays, namely |S], [2S] — [32]) [31] focusing on 
continuous-time systems, and OS], [3 Ch. 7.3], dealing with discrete-time protocols. More 
specifically, in [HI [28] the authors provide sufficient conditions for the convergence of a linear average 
consensus protocol, in the case of time- invariant homogeneous delays (i.e., equal delay for all the links) 
and assuming symmetric communication links. The most appealing feature of the dynamical system 
in [S] is the convergence of all the state variables to a known, delay-independent, function (equal to 
the average) of the observations. However, this desired property is paid in terms of convergence, since, 
in the presence of homogeneous delays, the system of [8] is able to reach a consensus if and only if 
the (common) delay is smaller than a given, topology-dependent, value. Moreover, the assumption 
of homogeneous delays and symmetric links is not appropriate for modeling the propagation delay in 
a typical wireless network, where the delays are proportional to the distance among the nodes and 
communication channels are typically asymmetric. 
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The protocol of [8j was generalized in [29] to time-invariant inhomogeneous delays (but symmetric 
channels) and in [30j to asymmetric channels. The dynamical systems studied in |291I30| are guaranteed 
to reach an agreement, for any given set of finite propagation delays, provided that the network 
is strongly connected. Similar results, under weaker (sufficient) conditions on the (possibly time- 
varying) network topology, were obtained in Oil], O Ch.7.3] and [22] for the convergence of discrete- 
time asynchronous agreement algorithms. However, since the final agreement value is not a known 
function of the local measurements, agreement algorithms proposed in the cited papers are mostly 
appropriate for the alignment of mobile agents, but they cannot be immediately used to distributively 
compute prescribed functions of the sensors' measurements, like decision tests or global parameter 
estimates. In [47], the authors studied the convergence properties of the agreement algorithms proposed 
in [29] — [31] and provided a closed form expression of the achievable consensus in the case of strongly 
connected balanced digraphs, under the assumption that the initial conditions are nonnegative and the 
state trajectories remain in the nonnegative orthant of the state space. However, the final agreement 
value depends on delays, network topology and initial conditions of each node, so that the bias is 
practically unavoidable. In summary, in the presence of propagation delays, classical distributed 
protocols reaching consensus/agreement on the state [6] — [21], [27] — [31], [17] cannot be used to achieve 
prescribed functions of the sensors' measurements that are not biased by the channel parameters. 

Ideally, we would like to have a totally decentralized system able to reach a global consensus on 
a final value which is a known, delay-independent, function of the local measurements (as in [8j), for 
any given set of inhomogeneous propagation delays (as in [29]) and asymmetric channels (as in [30]). 
In this paper, we propose a distributed dynamical system having all the above desired features. More 
specifically, we consider a set of linearly coupled first-order dynamical systems and we fully characterize 
its convergence properties, for a network with arbitrary topology (not necessarily strongly connected, 
as opposed to [29] — [31]) and (possibly) asymmetric communication channels. In particular, we consider 
an interaction model among the sensors that is directly related to the physical channel parameters. 
The network is modeled as a weighted directed graph, whose weights are directly related to the (fiat- 
fading) channel coefficients between the nodes and to the transmit power. Furthermore, the geometry- 
dependent propagation delays between each pair of nodes, as well as possible time offsets among the 
nodes, are properly taken into account. The most appealing feature of the proposed system is that a 
consensus on a globally optimal decision statistic is achieved, for any (bounded) set of inhomogeneous 
delays and for any set of (asymmetric) communication channels, with the only requirement that the 
network be quasi-strongly connected and the channel coefficients be nonnegative^ 

In particular, our main contributions are the following: i) We provide the necessary and sufficient 
conditions ensuring local or global convergence, for any set of finite propagation delays and network 
topology; ii) We prove that the convergence is exponential, with convergence rate depending, in general, 
on the channel parameters and propagation delays; iii) We derive a closed form expression for the final 
consensus, as a function of the attenuation coefficients and propagation delays of each link; iv) We 
show how to get a final, unbiased function of the sensor's measurements, which coincides with the 
globally optimal decision statistics that it would have been computed by a fusion center having error- 

^This last requirement, if not immediately satisfied, requires some form of phase compensation at the receiver. 
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free access to all the nodes. The paper is organized as follows. Section [2] describes the proposed 
first-order linearly coupled dynamical system and shows how to design the system's parameters and 
the local processing so that the state derivative of each node converges, asymptotically, to the globally 
optimal decision statistics. Section [3] contains the main results of the paper, namely the necessary and 
sufficient conditions ensuring the global or local convergence of the proposed dynamical system, in the 
presence of inhomogeneous propagation delays and asymmetric channels. Finally, Section d] contains 
numerical results validating our theoretical findings and draws some conclusions. 



2 Reaching Globally Optimal Decisions Through Self-Synchronization 

It was recently shown in [33] that, in many applications, an efficient sensor network design should 
incorporate some sort of in-network processing. In this paper, we show first a class of functions that 
can be computed with a totally distributed approach. Then, we illustrate the distributed mechanism 
able to achieve the globally optimal decision tests. 

2.1 Forms of Consensus Achievable With a Decentrahzed Approach 

If we denote hy yi, i = 1, N , the (scalar) measurement taken from node i, in a network composed 
of nodes, we have shown in j25l I26j that it is possible to compute any function of the collected data 
expressible in the form 

2^ Ci9i{yi) \ 

^^Tv ' (1) 

where {q} are positive coefficients and gi{-), i = 1, . . . ,N, and h{-) are arbitrary (possibly nonlinear) 
real functions on M, i.e., ^j, /i : M i— > M, in a totally decentralized way, i.e., without the need of a sink 
node. In the vector observation case, the function may be generalized to the vector form 



f(yi,y2,---,yjv) = h|^(^^Q^ (j2c,g,{y^)^^ , (2) 

where = {yi,k}k=i the vector containing the observations {yi,k}k=i taken be sensor i, and srj(-) 
and h(-) are arbitrary (possibly nonlinear) real functions on M^, i.e., g^, h : i— > R, and {Cj} are 
arbitrary square positive definite matrices. 

Even though the class of functions expressible in the form ([TI) or ([2|) is not the most general one, 
nevertheless it includes many cases of practical interest, as shown in the following examples. 

Example 1: ML or BLUE estimate. Let us consider the case where each sensor observes a vector 
in the form 

yi = A,^ + Wi, i = l,...,N, (3) 

where is the M x 1 observation vector, ^ is the L x 1 unknown common parameter vector, Aj is 
the M X L mixing matrix of sensor i, and Wj is the observation noise vector, modeled as a circularly 
symmetric Gaussian vector with zero mean and covariance matrix Rj. We assume that the noise 
vectors affecting different sensors are statistically independent of each other, and that each matrix 
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Ai is full column rank, which implies M > L. As well known, in this case the globally optimal ML 
estimate of 4 is [S] '■ 

^ML = f(yi,y2,---,y7v) = I >>rRr'A, 1 I >>/RrVi 1 . (4) 




This expression is a special case of ([2]), with Cj = AjJH- ^Aj and Qiiyi) = (A^R- ^Ai) ^Ajli- ^yj. 
If the noise pdf is unknown, ^ still represents a meaningful estimator, as it is the BLUE 



Example 2: Detection of a Gaussian process with unknown variance embedded in Gaus- 
sian noise with known variance. Let us consider now a detection problem. Let yi[k] denote the 
signal observed by sensor i, at time k. The detection problem can be cast as a binary hypothesis test, 
where the two hypotheses are 

Ho: yi[k]=Wi[k], ■ ^ at i, t fr,\ 

^ r,i TM r,i i = l,---,N, k = l,...,K, (5) 

rti : yi[k\ = Si[k\ + Wi[k\, 

where Si[k] is the useful signal and Wi[k] is the additive noise. Let us consider the case where the 
random sequences Si[k] and Wi[k] are spatially uncorrelated and modeled as zero mean independent 
Gaussian random processes. A meaningful model consists in assuming that the noise variance is known, 
let us say equal to o"^, whereas the useful signal variance is not. Under these assumptions, the optimal 
detector consists in computing the generalized likelihood ratio test (GLRT) [34^ [35] 

T(y) = /(y„y2, • • • ,y.) = ^ E E^f W f ^ - ^] - Elog (^] , (6) 

and comparing it with a threshold that depends on the desired false alarm rate. In ([UD, the term Pj 
denotes the ML estimate of the signal power at node i, given by 



\ k=l / 

where (x)^ = max(0, x) . Also in this case, it is easy to check that ([6]) is a special case of ([1]), with 
c, = 1 and 9,{y,) = j, Eti vfik] " ^) " log [35]. 

These are only two examples, but the expressions ([I|) or ([2]) can be used to compute or approximate 
more general functions of the collected data, like, maximum and minimum [12], geometric mean |13j . 
power mean |14j and so on, through appropriate choice of the parameters involved. 

2.2 How to Achieve the Consensus in a Decentrahzed Way 

The next question is how to achieve the aforementioned optimal decision statistics with a network 
having no fusion center. In |25[ [26] we proposed an approach to solve this problem using a nonlinear 
interaction model among the nodes, based on an undirected graph and with no propagation delays in 
the exchange of information among the sensors. In this paper, we consider a linear interaction model, 
but we generalize the approach to a network where the propagation delays are taken into account and 
the network is described by a weighted directed graph (or digraph, for short), which is a fairly general 
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model to capture the non reciprocity of the communication hnks governing the interaction among the 
nodes. 

The proposed sensor network is composed of nodes, each equipped with four basic components: 
i) a transducer that senses the physical parameter of interest (e.g., temperature, concentration of 
contaminants, radiation, etc.); ii) a local processing unit that processes the measurement taken by the 
node; iii) a dynamical system, whose state is initialized with the local measurements and it evolves 
interactively with the states of nearby sensors; iv) a radio interface that transmits the state of the 
dynamical system and receives the state transmitted by the other nodes, thus ensuring the interaction 
among the nodej^. 

Scalar observations. In the scalar observation case, the dynamical system present in node i evolves 
according to the following linear functional differential equation 

Xiit;y) = giivi) H aij {xj{t - r^j-y) - Xi{t;y)) , t > 0, 

"^S-eM i = l,...,N, (7) 

Xi{t;y) = (pi{t), te [-T, 0], 

where Xi{t; y) is the state function associated to the i-th sensor that depends on the set of measurements 
y = {yi}'h=i'i diiVi) is a function of the local observation y^, whose form depends on the specific decision 
test; X is a positive coefficient measuring the global coupling strength; q is a positive coefficient that 
may be adjusted to achieve the desired consensus; Tij = Tij + dij/c is a delay incorporating the 
propagation delay due to traveling the internode distance dij, at the speed of light c, plus a possible 
time offset T^j between nodes i and j. The sensors are assumed to be fixed so that all the delays 
are constant. We also assume, realistically, that the maximum delay is bounded, with maximum 
value T = maxijTij. The coefficients Oij are assumed to be nonnegative and, in general, dependent 
on transmit powers and channel parameters. For example, Uij may represent the amplitude of the 
signal received from node i and transmitted from node j. In such a case, we have Oij = Pj\hij\^ / d!l-, 
where Pj is the power of the signal transmitted from node j; hij is a fading coefficient describing the 
channel between nodes j and i; r/ is the path loss exponent. The nonnegativity of Oij requires some 
form of channel compensation at the receiver side, like in the maximal ratio receiver, for example, if 
the channel coefficients {/ijjjij are complex variables. Furthermore, we assume, realistically, that node 
i "hears" node j only if the power received from i exceeds a given threshold. In such a case, Oij ^ 0, 
otherwise aij = 0. The set of nodes that sensor i hears is denoted by A/i = {j = 1, . . . , : ajj / 0}. 
Observe that, in general, Oij ^ aji, i.e., the channels are asymmetric. It is worth noticing that the 
state function of, let us say, node i depends, directly, only on the measurement yi taken by the node 
itself and only indirectly on the measurements gathered by the other nodes. In other words, even 
though the state Xj(t;y) gets to depend, eventually, on all the measurements, through the interaction 
with the other nodes, each node needs to know only its own measurement. 

Because of the delays, the state evolution ([7]) for, let us say, t > 0, is uniquely defined provided 

^The state value is transmitted by modulating an appropriate carrier. Because of space limitations, it is not possible 
to go into the details of this aspect in this paper. In parallel works, we have shown that a pulse position modulation of 
ultrawideband signals may be a valid candidate for implementing the interaction mechanism. Some preliminary remarks 
on the implementation of the proposed protocol can be found in [38] . 
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that the initial state variables Xi{t;y) are specified in the interval from — r to 0. The initial conditions 
of ([7]) are assumed to be taken in the set of continuously differentiable and bounded functions 4>i{t) 
mapping the interval [— r, 0] to R (see Appendix [B] for more details). 

Vector observations. If each sensor measures a set of, let us say, L physical parameters , the coupling 
mechanism ([7|) generalizes into the following expressioij^l 

Xi(t;y) = gi(yi) +K Qr^ ^ aij (xj(t - Tij;y) -Xj(i;y)) , t > 0, 

ieM i = l,...,iV, (8) 

x,(t;y) = 0,(t), te[-T, 0], 

where Xj(t) is the L-size vector state of the i-th node; S'j(yi) is a vector function of the local observation 
Vi — {yi,fc}fc=ii ^-^^ 9i • ^ Qi is an L X L non-singular matrix that is a free parameter to 

be chosen according to the specific purpose of the sensor network. 

As in ([7]), the initial conditions of ([8]) are assumed to be arbitrarily taken in the set of continuously 
differentiable and bounded (vectorial) functions </>j(i) mapping the interval [— r, 0] to M^, w.l.o.g.. 

2.3 Self-Synchronization 

Differently from most papers dealing with average consensus problems [6] — [23] and [27] — [31], where 
the global consensus was intended to be the situation where all dynamical systems reach the same 
state value, we adopt here the alternative definition already introduced in our previous work [26]. We 
define the consensus (through network synchronization) with respect to the state derivative, rather 
than to the state, as follows. 

Definition 1 Given the dynamical system in ^ (or (S^), a solution {x*{t;y)} of ^ (or (B^) is said 
to be a synchronized state of the system, if 

±*{t;y)=u*{y), Vi = 1, 2, . . . , iV. (9) 

The system ^ ( or l^j ) is said to globally synchronize if there exists a synchronized state as in (0), 
and all the state derivatives asymptotically converge to this common value, for any given set of initial 
conditions {4>i}, i.e., 

lim ||i,(t;y)-^'^(y)|| =0, V0„ i = 1, 2, . . . , iV, (10) 

tf— ♦oo 

where || • || denotes some vector norm and {xj(t; y)} is a solution of ^ (or (B^). The synchronized state 
is said to be globally asymptotically stable if the system globally synchronizes, in the sense specified in 
il(J\}. The system ^ (or ^) is said to locally synchronize if there exist disjoint subsets of the nodes, 
called clusters, where the nodes in each cluster have state derivatives converging, asymptotically, to the 
same value, for any given set of initial conditions {(pi}. 

Observe that, according to Definition (U if there exists a globally asymptotically stable synchronized 
state, then it must necessarily be unique (in the derivative). In the case of local synchronization instead, 
the system may have multiple synchronized clusters, each of them with a different synchronized state. 

■^We assume that the coupling coefficients aij are the same for all estimated parameters. This assumption is justified 
by the fact that all the parameters are sent through the same physical channel. 
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As it will be shown in the next section, one of the reasons to introduce a novel definition of consensus, 
different from the classical one [6] — [2jj, [27] — [31], is that the convergence on the state derivative, rather 
than on the state, is not affected by the presence of propagation delays and there is a way to make the 
final consensus value to coincide with the globally optimal decision statistic. In the ensuing sections, 
we will provide necessary and sufficient conditions for the system in ([7]) (or dS])) to locally/globally 
synchronize according to Definition [H along with the closed form expression of the synchronized state. 



3 Necessary and Sufficient Conditions for Self-Synchronization 

The problem we address now is to check if, in the presence of propagation delays and asymmetric 
communication links, the systems ([7]) and ^ can still be used to achieve globally optimal decision 
statistics in the form ([T]) and dJ]), in a totally distributed way. To derive our main results, we rely on 
some basic notions of digraph theory. To make the paper self-contained, in Appendix lA.ll we recall 
the basic definitions of weak, quasi-strong and strong connectivity (WC, QSC, and SC, for short) of 
a digraph. In Appendix lA. 21 we recall the algebraic properties of the Laplacian matrix associated to 
a digraph and we derive the properties of its left eigenvectors, as they will play a fundamental role 
in computing the achievable forms of consensus. In Appendix [Bl we provide sufficient conditions for 
the marginal stability of linear delayed differential equations, as they will be instrumental to prove the 
main results of this paper. 

The next theorem provides necessary and sufficient conditions for the proposed decentralized ap- 
proach to achieve the desired consensus in the presence of propagation delays and asymmetric com- 
munication links. 

Theorem 1 Let ^ ={'^, (o} be the digraph associated to the network in with Laplacian matrix L. 
Let 7 = [7;^, . . . ,7^]^ be a left eigenvector o/L corresponding to the zero eigenvalue, i.e., 7^L = 0^. 
Given the system in assume that the following conditions are satisfied: 

al The coupling gain K and the coefficients {cj} are positive; the coefficients {flij} are non-negative; 

a2 The propagation delays {Tj^} are time-invariant and finite, i.e., Tij < r = maxj^j Tj^ < -|-oo, 
Vi / i; 

a3 The initial conditions are taken in the set of continuously differentiable and bounded functions 
mapping the interval [— r, 0] to M^. 

Then, system ^ globally synchronizes for any given set of propagation delays, if and only if the 
digraph §f is Quasi- Strongly Connected (QSC). The synchronized state is given by 

EN 
. .liCigiivi) 

-'■^y) = ^ '^'n ^ ^ (11) 

2^,=i + K ^^^^ ^^.^^^ 7.a.,r,, 

where 7j > i/ and only if node i can reach all the other nodes of the digraph through a strong path, 
or 7j = 0, otherwise. 

The convergence is exponential, with rate arbitrarily close to r = — minj{|Re{sj}| : p{si) = 
and Si ^ 0}, where p{s) is the characteristic function associated to system ^ (cf. Appendix W\) . 
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Proof. See Appendix [O ■ 
Theorem [1] has a very broad apphcabihty, as it does not make any particular reference to the 
network topology. If, conversely, the topology has a specific structure, then we may have the following 
forms of consensus ^ 

Corollary 1 Given system assume that conditions al)-a3) of Theorem{l\ are satisfied. Then, 

1. The system globally synchronizes and the synchronized state is given by 

io*{y)=gr{yr), r e {1,2,...,N}, (12) 

if and only if the digraph ^ contains only one spanning directed tree, with root node given by 
node r. 

2. The system globally synchronizes and the synchronized state is given by with all 7^ 's positive 
if and only if the digraph ^ is Strongly Connected (SC). The synchronized state becomes 

-^(y) = ^^^v ^'^N ^ > (13) 

if and only if, in addition, the digraph 'i§ is balanced. 

3. The system locally synchronizes in K disjoint clusters "^i, . . . , ^ {1, • • • 7 A^jlfl with synchro- 
nized state derivatives for each cluster 

y JiCigi{yi) 

xl(t;yk) = — — , VgG^fc, k = l,...,K, (14) 

withy k = {yi}i£'^k, if and only if the digraph'^ is weakly connected (WCJ and contains a spanning 



directed forest with K Root Strongly Connected Components (RSCC) 



The proof of this corollary is a particular case of Theorem 1, except that we exploit the structure 
of the left eigenvector corresponding to the zero eigenvalue of the Laplacian matrix L, as derived 
in Appendix El 2. The previous results can be extended to the vector case in ([8]), according to the 
followingl^ 

Theorem 2 Given the system ^B^, assume that conditions a2)-a3) of Theorem{l\ are satisfied and that 
the matrices {Qj} are positive definite. Then, the system synchronizes for any given set of propagation 
delays, if and only if the digraph ^ is QSC. The synchronized state is given by 

(N / N N \\^^/^ \ 

1^Q^ + II ® U 5; l^a^JT,J l^Q^g^{y^) , (15) 

i=l \ i=l jeAfi I I \i=l j 



^We focus, w.l.o.g., only on WC digraphs. In the case of non WC digraphs, Corollary [T] applies to each disjoint 
component of the digraph. 

*In general, the clusters "^i, . . . , are not a partition of the set of nodes {1, • ■ ■ , A^}. 
^Please, see Appendix|All for the formal definition of (root) strongly connected component. 

^The proof is omitted because of space limitations, but it follows the same guidelines as the proof for the scalar case, 
since the interaction among the nodes is the same. 
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where (8) denotes the Kronecker product and 'j^ > if and only if node i can reach all the other 
nodes of the digraph by a strong path, or 7^ = 0, otherwise. The convergence is exponential, with 
rate arbitrarily close to r = — minj{|Re{sj}| : p{si) = and Si ^ 0}, where p{s) is the characteristic 
function associated to system 

3.1 Impact of propagation delays on convergence 

The impact of delays in consensus-achieving algorithms has been analyzed in a series of works [U 
Ch.7.3], [8], [22], [28] — [31]. Among these works, it is useful to distinguish between consensus al- 
gorithms, [8l|23], where the states of all the sensors converge to a prescribed function (typically the 
average) of the sensors' initial values, and agreement algorithms, [5, Ch.7.3], [22], [28j — [3T]. |47j . where 
the goal is to make all the states to converge to a common value, but without specifying how this value 
has to be related to the initial values. In our application, we can only rely on consensus algorithms, 
where the final consensus has to coincide with the globally optimal decision statistic. 

The consensus algorithms analyzed in [8l [23] assume the same delay value for all the links, i.e., 
Tij = T, and symmetric channels, i.e., Uij = aji, for all i ^ j. Under these assumptions, the average 
consensus in |8l [23] is reached if and only if the common delay r is upper bounded by [8] 

vr 1 , , 

^ < 2 a;7' 

where A at denotes the maximum eigenvalue of the Laplacian associated to the undirected graph of the 
network (cf. Appendix |A|) . Since, for any graph with nodes, we have |39] 

A^ 

j;^ dma.x < Aat < 2 dmax, (17) 

with diaax denoting the maximum graph degree, increasing dmax imposes a strong constraint on the 
maximum (common) tolerable delay. This implies, for example, that networks with hubs (i.e., nodes 
with very large degrees) that are commonly encountered in scale-free networks |36j . are fragile against 
propagation delays, when using the consensus algorithms of [51 [^. 

In our application, we were motivated to extend the approach of [8l [23] to the general case of 
inhomogeneous delays and asymmetric channel links. Nevertheless, in spite of the less restrictive 
assumptions, Theorem [1] shows that our proposed algorithm is more robust against propagation delays, 
since its convergence capability is not affected by the delays. Moreover, our approach is valid in the 
more general case of asymmetric communications links and the final value is not simply the average 
of the measurements, but a weighted average of functions of the measurements that can be made to 
coincide with the desired globally optimal decision statistic in the form ([T]) or ([2D, through a proper 
choice of the coefficients Cj. 

An intuitive reason for explaining the main advantage of our approach is related to the use of an 
alternative definition of global consensus: As opposed to conventional methods requiring consensus on 
the state value, i.e. [6] — [H], [28] — [31], we require the convergence over the state derivative, i.e., we 
only require that the state trajectories converge towards parallel straight lines. The slope must be the 
same for all the trajectories and it has to coincide with the desired decision statistic. But the constant 
terms of each line may differ from sensor to sensor. This provides additional degrees of freedom that, 
eventually, make our approach more robust against propagation delays or link coefficients. 
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3.2 Effect of network topology on final consensus value 

Theorem [1] generalizes all the previous (only sufficient) conditions known in the literature [8], |28j — 
|31j for the convergence of linear agreement /consensus protocols in the presence of propagation delays, 
since it provides a complete characterization of the synchronization capability of the system for any 
possible degree of connectivity in the network (not only for SC digraphs), as detailed next. 

In general, the digraph modeling the interaction among the nodes may have one of the following 
structures: i) the digraph contains only one spanning directed tree, with a single root node, i.e., there 
exists only one node that can reach all the other nodes in the network through a strong directed path; 
ii) the digraph contains more than one spanning directed tree, i.e., there exist multiple nodes (possibly 
all the nodes), strongly connected to each other, that can reach all the other nodes by a strong directed 
path; iii) the digraph is weakly connected and contains a spanning forest, i.e., there exists no node 
that can reach every other node through a strong directed path; iv) the digraph is not even weakly 
connected. The last case is the least interesting, as it corresponds to a set of isolated subnetworks, 
that can be analyzed independently of each other, as one of the previous cases. In the first two cases, 
according to Theorem [U system ([7]) achieves a global consensus, whereas in the third case the system 
forms clusters of consensus with, in general, different consensus values in each cluster, i.e., the system 
synchronizes only locally. 

In other words, a global consensus is possible if and only if there exists at least one node (the root 
node of the spanning directed tree of the digraph) that can send its information, directly or indirectly, 
to all the other nodes. If no such a node exists, a global consensus cannot be reached. However, a local 
consensus is still achievable among all the nodes that are able to influence each other. 

Interestingly, the closed form expressions of the synchronized state given by (jlip confirms the above 
statements: The observation yj of, let us say, sensor i affects the final consensus value if and only if 
such an information can reach all the other nodes by a strong directed path. As a by-product of this 
result, we have the following special cases (Corollary 1): If there is only one node that can reach all the 
others, then the final consensus depends only on the observation taken from that node; on the other 
extreme, the final consensus contains contributions from all the nodes if and only if the digraph is SC. 

Moreover, if a node contributes to the final value, it does that through a weight that depends on 
its in/out-degree. The set of weights {j^} in (jlip can be interpreted as a measure of the "symme- 
try/asymmetry" of the communication links in the network: Some of these weights are equal to each 
other if and only if the subdigraph associated to the corresponding nodes is balanced (or undirected). 

The synchronized state, as given in (jlip . suggests also an interesting interpretation of the consensus 
formation mechanism of system ([7]), based on the so called condensation digraph|§ From (jlip in fact, 
one infers that all the nodes that are SC to each other (usually referred to as nodes of a strongly 
connected component (SCC)) produce the same effect on the final consensus as an equivalent single 
node that represents the consensus within that SCC. In fact, one may easily check if system in ([7]) 
locally or globally synchronizes and which nodes contribute on the consensus, simply reducing the 
original digraph to its equivalent condensation digraph and looking for the existence of a spanning 

^Please, refer to Appendix I A . 1 1 for the definition of condensation digraph and the procedure for reducing a digraph 
into its condensation digraph. 
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directed tree in the condensation digraph. The only SCCs of the original digraph that will provide 
a contribution on the final consensus are the SCCs associated to the root nodes of the condensation 
digraph. 

As an additional remark, the possibility to form clusters of consensus, rather than a global con- 
sensus, depends on the (channel) coefficients Ojj. If, e.g., these coefficients have the expression as 
given in Section \2.2\ they may be altered by changing the transmit powers Pj. As a consequence, the 
nodes with the highest transmit power will be the most influential ones. If, for example, we want to 
set a certain parameter on each node, like, e.g., a decision threshold, we can use the same consensus 
mechanism used in this paper by assigning, for example, the desired value to, let us say, node i, and 
select the transmit powers so that node i is the only node that can reach every other node. 

3.3 How to get unbiased estimates 

The closed form expression of the synchronized state, as given in (jlip (or in (|15p ). is valid for any given 
digraph (not only for undirected graphs as in [8]). Expression (llip shows a dependence of the final 
consensus on the network topology and propagation parameters, through the coefficients {aij}, {7j} 
and the delays {tjj}. This means that, even if the propagation delays do not affect the convergence 
of the proposed system, they introduce a bias on the final value, whose amount depends on both the 
delays and coefficients {aij}. The effect of propagation parameters and network topology on the final 
synchronized state is also contained in the eigenvector 7 of the Laplacian L. This implies that the 
final consensus resulting from (jlip cannot be made to coincide with the desired decision statistics as 
given by ([1]), except that in the trivial case where all the delays are equal to zero and the digraph is 
balanced (and thus strongly connected). However, expression (jlip suggests a method to get rid of any 
bias, as detailed next. 

The bias due to the propagation delays can be removed using the following two-step algorithm^] 
We let the system in ([7D to evolve twice: The first time, the system evolves according to ([7]) and we 
denote by uj*{y) the synchronized state; the second time, we set gi{yi) = 1 in ([7]), for all i, and the 
system is let to evolve again, denoting the final synchronized state by a;*(l). From (jlip . if we take the 
ratio w*(y)/w*(l), we get 



which coincides with the ideal value achievable in the absence of delays. Thus, this simple double-step 
algorithm allows us to remove the bias term depending on the delays and on the channel coefficients, 
without requiring the knowledge or estimate of neither set of parameters. 

If the network is strongly connected and balanced, 7j = l,Vi and then the compensated consen- 
sus coincides with the desired value ([T]). If the network is unbalanced, the compensated consensus 
uj* {y) / u)* {1) does not depend on the (channel) coefficients {oij}, but it is still biased, with a bias 
dependent on 7, i.e., on the network topology. Nevertheless, this residual bias can be eliminated in 
a decentralized way according to the following iterative algorithm. Let us denote by N^- the number 
of nodes in the RSCC of the digraph. At the beginning, every node sets gi{yi) = 1 and Cj = 1, 

^We focus only on the scalar system in (0, because of space limitation. 




(18) 
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i = 1, . . . ,N and the network is let to evolve. The final consensus value is denoted by uj*{1). Then, the 
network is let to evolve Nr times, according to the following protocol. At step i, with i = 1, . . . , N^, 
the nodes within the i-th SCC, set gi{yi) = 1, while all the other nodes set gkiuk) = for all k ^ i. Let 
us denote by uj*{ei) the final consensus value, where Cj is the canonical vector having all zeros, except 
the i-th component, equal to one. Repeating this procedure for all the SCO's, at the end of the Nr 
steps, each node is now able to compute the ratio a;*(ej)/a;*(l), which coincides with 7^ := J2k 7fc- 
Thus, after N^ + l steps, every node knows its own (normalized) 7,. This value is subsequently used to 
compensate for the network unbalance as follows. The compensation is achieved by simply setting, at 
each node q = Ci/j^. In fact, with this setting, the final ratio uj* (y) / uj* {!) coincides with the desired 
unbiased expression given by ([I]), for all i such that 7j 7^ 0. If the digraph ^ is SC, this procedure 
corresponds indeed to make the Laplacian matrix L(l^) balanced, so that the left eigenvector of L 
associated to the zero eigenvalue be proportional to the vector This procedure only needs some 
kind of coordination among the nodes to make them working according to a described scheduling. It 
is important to remark that, since the eigenvector 7 does not depend on the observations {i/i}, the 
proposed algorithm is required to be performed at the start-up phase of the network and repeated 
only if the network topology or the channels change through time. In summary, we can eliminate the 
effect of both delays and channel parameters on the final consensus value, thus achieving the optimal 
decision statistics as given in ([1]), with a totally decentralized algorithm, at the price of a slight increase 
of complexity and the need for some coordination among the nodes0 

3.4 Asymptotic convergence rate 

Theorem [T] generalizes previous results on the convergence speed of classical linear consensus protocols 
(see, e.g. [8l [23]) to the case in which there are propagation delays. In spite of the presence of 
delays, the proposed system still converges to the consensus with exponential convergence rate, i.e., 
||x(t;y) — a;*(y)|| < O (e*"*), where the convergence factor r < is defined in Theorem [TJ Thus, 
the convergence speed is dominated by the slowest "mode" of the system. Moreover, as expected, 
the presence of delays affects the convergence speed, as the roots of characteristic equation p{s) = 
associated to system d?]) depend, in general, on both network topology and propagation delays. 
Unfortunately, a closed form expression of this dependence is not easy to derive, and the roots of 
p{s) = need to be computed numerically. 

In the special case of negligible delays instead, we can provide a bound of the convergence factor as a 
function of the channel parameters through the eigenvalues of the Laplacian matrix L. Setting Tjj = 
in ([7|), the characteristic equation associated to ([7]) becomes p{s) = \sl + L| =0, whose solutions are 
just the eigenvalues of — L. Under the assumptions of Theorem [H it follows that 

r = - min{Re{Ai} : Aj G a{L}, and Ai ^ 0}, (19) 

i 

where cr{L} denotes the spectrum of Laplacian matrix L. The value of r is negative if and only if 
the digraph 5^ is QSC [l5l Lemma 2]. Using p^ . we can obtain bounds on the convergence rate as a 

^"in [37], the authors provided a closed form expression of the agreement obtainable with an SC balanced network 
achieving consensus on the state. However, that expression depends on the channel parameters and on the initial 
conditions in a way that the bias cannot be eliminated through a distributed procedure. 
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function of the network topology, as shown next. When the digraph is strongly connected we hav( 




r < k4 -A2 Q(D^L + L^D^)^ < 0, (20) 

where = diag(7]^, • ■ ■ , 7^), 7 is the left eigenvector of L associated to the zero eigenvalue, normal- 
ized so that II7II00 = 1) and A2(A) denotes the second smallest eigenvalue of the symmetric matrix A. 
It follows from ()20p that system ([7]) reaches a consensus with rate at least k. As special cases of (|20p 
we have the following: i) In the case of balanced digraphs, = I and thus k = — A2 (^(L + L^)); ii) 
If the digraph is undirected (and connected) = I, L = L-^ and thus k = — A2 (L), where A2 (L) is 
also known as the algebraic connectivity of the digraph [39]. For the case in which the digraph is QSC, 
some bounds of r can be found in [44 [ H6] using the generalization of the classical definition of alge- 
braic connectivity. Moreover, interestingly, the convergence rate of the system, under the assumption 
of Theorem [H can be related to the convergence rates of the SCCs of condensation digraph associated 
to the system. Because of lack of space, we suggest the interested reader to check [25] for more details. 

In conclusion, according to the above results, we infer that the convergence rate of the proposed 
consensus algorithm, at least in the absence of propagation delays, is the same as that of the classical 
linear protocols achieving consensus on the state. 

4 Numerical Results 

In this section, we illustrate first some examples of consensus, for different network topologies. Then, we 
show an application of the proposed technique to an estimation problem, in the presence of random link 
coefficients. In both examples, the analog system ([7|) is implemented in discrete time, with sampling 
step size Ts = 10~^. 

Example 1: Different forms of consensus for different topologies 

In Figure dl we consider three topologies (top row), namely: (a) a SC digraph, (b) a QSC digraph 
with three SCCs, and (c) a WC (not QSC) digraph with a spanning forest composed by two trees. For 
each digraph, we also sketch its decomposition into SCCs (each one enclosed in a circle), corresponding 
to the nodes of the associated condensation digraph (whose root SCC is denoted by RSCC). In the 
bottom row of Figure [H we plot the dynamical evolutions of the state derivatives of system ([7|) versus 
time, for the three network topologies, together with the theoretical asymptotic values predicted by 
(jlip (dashed line with arrows). As proved by Theorem (H the dynamical system in Figured^) achieves 
a global consensus, since the underlying digraph is SC. The network of Figure [Da), instead, is not 
SC, but the system is still able to globally synchronize, since there is a set of nodes, in the RSCC 
component, able to reach all the other nodes. The final consensus, in such a case, contains only 
the contributions of the nodes in the RSCC, since no other node belongs to the root of a spanning 
directed tree of the condensation digraph. Finally, the system in Figure [!}:) cannot reach a global 
consensus since there is no node that can reach all the others, but it does admit two disjoint clusters, 
corresponding to the two RSCCs, namely RSCCi and RSCC2. The middle lines of Figured);) refer to 
the nodes of the SCC component, not belonging to either RSCCi or RSCC2, that are affected by the 
^^The inequality in (|20p follows from the fact that Re{Ai} > for all nonzero eigenvalues {A^} of L 145) . 
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consensus achieved in the two RSCC components, but that cannot affect them. Observe that, in all 
the cases, the state derivatives of the (global or local) clusters converge to the values predicted by the 
closed form expression given in (fTTI) . (fT2ll or (fT4|) . depending on the network topology. 




Example 2: Distributed optimal decisions through self-synchronization. The behaviors 
shown in the previous example refer to a given realization of the topology, with given link coefficients, 
and of the observations. In this example, we report a global parameter representing the variance 
obtained in the estimate of a scalar variable. Each sensor observes a variable yi = Ai^^ + Wi, where 
Wi is additive zero mean Gaussian noise, with variance af. The goal is to estimate ^. The estimate is 
performed through the interaction system d?]), with functions gi{yi) = Vi/Ai and coefficients Cj = A^/af, 
chosen in order to achieve the globally optimal ML estimate. The network is composed of 40 nodes, 
randomly spaced over a square of size D. The size of the square occupied by the network is chosen in 
order to have a maximum delay r = lOOTg. We set the threshold on the amplitude of the minimum 
useful signal to zero, so that, at least in principle, each node hears each other node. The corresponding 
digraph is then SC. To simulate a practical scenario, the channel coefficients Ojj are generated as 
i.i.d. Rayleigh random variables, to accommodate for channel fading. Each variable aij has a variance 
depending on the distance dij between nodes i and j, equal tc0 afj = Pj/{1 + dfj). 

In Figure O we plot the estimated average state derivative (plus and minus the estimation standard 
deviation) , as a function of the iteration index. Figure [5^^) refers to the case in which there is only 

^■^We use the attenuation factor 1/(1 + dij) instead of l/d^j to avoid the undesired event that, for dij < 1 the received 
power might be greater than the transmitted power. 
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Figure 2: Estimated parameter vs. convergence time in the absence [subplot a)] and in the presence [subplot 
b)] of receiver noise: a) ML estimate by a centralized system (dotted lines); b) estimate by the proposed method, 
with no propagation delays, (dashed and dotted lines plus "x"for the average value); c) estimate by the proposed 
method, with propagation delays (dashed lines plus "<" for the average value); d) estimate by the two-step 
estimation method leading to p8|l (solid lines plus "o" for the average value). 



observation noise, but there is no noise in the exchange of information among the nodes. Conversely, 
Figure [2)3) refers to the case where the node interaction is noisy, so that the state evolution of each 
sensor is described by the state equation ij(t) = Xi(t) + Vi(t), with Xi{t) given by ([7]), where Vi{t) is 
white Gaussian noise; the SNR is ICP/c^ = 20 dB. The averages are taken across all the nodes, for 100 
independent realizations of the network, where, in each realization we generated a new topology and 
a new set of channel coefficients and noise terms. The results refer to the following cases of interest: 
a) ML estimate achieved with a centralized system, with no communication errors between nodes and 
fusion center (dotted lines); b) estimate achieved with the proposed method, with no propagation 
delays, as a benchmark term (dashed and dotted lines plus x marks for the average value); c) estimate 
achieved with the proposed method, in the presence of propagation delays (dashed lines plus triangles 
for the average value); d) estimate achieved with the two-step estimation method leading to (jlSp (solid 
lines plus circles for the average value). From Figure [2] we can see that, in the absence of delays, the 
(decentralized) iterative algorithm based on ([7]) behaves, asymptotically, as the (centralized) globally 
optimal ML estimator. In the presence of delays, we observe a clear bias (dashed lines), due to the 
large delay values, but with a final estimation variance still close to the ML estimator's. Interestingly, 
the two-step procedure leading to (jlSp provides results very close to the optimal ML estimator, with 
no apparent bias, in spite of the large delays and the random channel fading coefficients. The only 
price paid with the two-step procedure, besides time, is a slight increase of the variance due to taking 
the ratio of two noisy consensus values, as evidenced in Fig. [2]b). 



5 Concluding remarks 

In conclusion, in this paper we have proposed a totally decentralized sensor network scheme capable 
to reach globally optimal decision tests through local exchange of information among the nodes, in 
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the presence of asymmetric communication channels and inhomogeneous time-invariant propagation 
delays. The method is particularly useful for applications where the goal of the network is to take 
decisions about a common event. Differently from the average consensus protocols available in the 
literature, our system globally synchronizes for any set of (finite) propagation delays if and only if 
the underlying digraph is QSC, with a final synchronized state that is a known function of the sensor 
measurements. In general, the synchronized state depends on the propagation parameters, such as 
delays and communication channels. Nevertheless, exploiting our closed form expression for the final 
consensus values, we have shown how to recover an unbiased estimate, for any set of delays and channel 
coefficients, without the need to knowing or estimating these coefficients. This desirable result is a 
distinctive property of the consensus achievable on the state derivative and cannot be obtained using 
classical consensus/agreement algorithms that reach a consensus on the state variables. If we couple 
the nice properties mentioned above with the properties reported in [26], where we showed that, in 
the absence of delays, the consensus protocol proposed in this paper and in [26] is also robust against 
coupling noise, we have, overall, a good candidate for a distributed sensor network. 

As in many engineering problems, the advantages of our scheme come with their own prices. Three 
issues that deserve further investigations are the following: i) the states grow linearly with time; ii) the 
coefficients aij are nonnegative; and iii) a change of topology affects the convergence properties of the 
proposed scheme. The first issue has an impact on the choice of the radio interface responsible for the 
exchange of information between the nodes. To avoid the need for transmitting with a high dynamic 
range, the nodes must transmit a nonlinear, bounded function of the state value. One possibility, as 
proposed in [35] , is to associate the state value to the phase of sinusoidal carrier or to the time shift of 
a pulse oscillator. The second point requires that the receiver be able to compensate for possible sign 
inversions. As far as the switching topology is concerned, it would be useful to devise methods aimed 
at increasing the resilience of our method against topology changes. But it is worth keeping in mind 
that the previous aspects are only the reverse of the medal of a method capable to achieve a globally 
optimal decision for any set of delays and for asymmetric channels. 

6 Appendix 

A Directed Graphs 

The interaction among the sensors is properly described by a directed graph. For the reader's con- 
venience, in this section, we briefly review the notation and basic results of graph theory that will 
be used throughout this paper. For the reader interested in a more in-depth study of this fleld, we 
recommend, for example, [39| — [IT] . 

A.l Basic Definitions 

To take explicitly into account the possibility of unidirectional links among the network nodes, we 
represent the information topology among the nodes by their (weighted) directed graph. 
Directed graph. Given nodes, a (weighted) directed graph (or digraph) ^ is defined as ^ ={'^, S"}, 
where Y = {vi, . . . ,vn} is the set of nodes (or vertices) and (f C x 1^ is a set of edges (i.e., ordered 
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pairs of the nodes), with the convention that Cij = {vi,Vj) € (i.e., Vi and Vj are the head and the 
tail of the edge Cij , respectively) means that the information flows from vj to Vi. A digraph is weighted 
if a positive weight is associate to each edge, according to a proper map W : <§ ^ M+, such that if 
fiij — {vi,Vj) G ff, then iVieij) = a-j > 0, otherwise aij = 0. We focus in the following on weighted 
digraphs where the weights of loops {vi,Vi) are zero, i.e., an = for all i. If {vi, Vj) G <^{vj,Vi) G S" 
(and aij = aji, Vi 7^ j), then the graph is said to be (weighted) undirected. For any node Vi G i^, we 
define the information neighbor of Vi as 

M^^{j = l,...,N : e,, = {vi,v,) G ^} . (21) 

The set Mi represents the set of indices of the nodes sending data to node i. 
The in-degree and out-degree of node Vi G Y are, respectively, defined as: 

N N 

degin(t^») = ^(^Sontivi) =^aji. (22) 

Observe that for undirected graphs, degi^{vi) = degout(^i)- 

We may have the following class of digraphs. 
Balanced digraph The node Vi of a digraph ^ ={^, is said to be balanced if and only if its 
in-degree and out-degree coincide, i.e., deg^^{vi) = dego^t(^^)• ^ digraph ={'f,S'} is called balanced 
if and only if all its nodes are balanced, i.e., 

N N 

^aij = ^aji, yi = l,...,N. (23) 
i=i i=i 

Path/cycle A strong path (or directed chain) in a digraph ^ ={'f,S'} is a sequence of distinct nodes 
vq, . . . ,Vq G y such that {vi,Vi-i) G S", \/i = 1, . . . , q. li vq = Vg, the path is said to be closed. A weak 
path is a sequence of distinct nodes vq,. . . ,Vq G y such that either {vi-i,Vi) G <^ or (t7j,?7j_i) G (o", 
Vi = 1, . . . , g. A (strong) cyc/e (or circuit) is a closed (strong) path. 

Directed tree/forest A digraph with nodes is a (rooted) directed tree if it has — 1 edges 
and there exists a distinguished node, called the root node, which can reach all the other nodes by a 
(unique) strong path. Thus a directed tree cannot have cycles and every node, except the root, has 
one and only one incoming edge lf| A digraph is a (directed) forest if it consists of one or more directed 
trees. A subgraph %={%,S'^} of a digraph W ={r,g], with % -f and C gr\{% x %) , is a 
spanning directed tree (or a spanning directed forest), if it is a directed tree (or a directed forest) and 
it has the same node set as ^, i.e., 1^ = We say that a digraph ^ ={'^, ^} contains a spanning tree 
(or a spanning forest) if there exists a subgraph of 5^ that is a spanning directed tree (or a spanning 
directed forest). 

Connectivity. In a digraph there are many degrees of connectedness. In this paper we focus on the 
following. A digraph is Strongly Connected (SC) if, for every pair of nodes Vi and Vj, there exists a 
strong path from Vi to Vj and viceversa. A digraph is Quasi- Strongly Connected (QSC) if, for every 



Observe that some literature (e.g., [201 121) 1 defines this concept using the opposite convention for the orientation of 
the edges. 
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pair of nodes Vi and Vj, there exists a node r that can reach both Vi and vj by a strong path. A digraph 
is weakly connected (WC) if any pair of distinct nodes can be joined by a weak path. A digraph is 
disconnected if it is not weakly connected. 

According to the above definitions, it is straightforward to see that strong connectivity imphes quasi 
strong connectivity and that quasi strong connectivity impUes weak connectivity, but the converse, in 
general, does not hold. For undirected graphs instead, the above notions of connectivity are equivalent: 
An undirected graph is connected if any two distinct nodes can be joined by a path. Moreover, it is 
easy to check that the quasi strong connectivity of a digraph is equivalent to the existence of at least 
one spanning directed tree in the graph (see, e.g., [l2l p. 133 ]). 

Condensation Digraph When a digraph ^ is WC, it may still contain strongly connected subgraphs. 
A maximal subgraph of ^ which is also SC is called Strongly Connected Component (SCC) of [40]. 
Using this concept, any digraph ={Y,S'} can be partitioned into SCCs, let us say ^1=1^1, 
. . . ,'^^k—{'^k,<^k}i where Yj C y and S'j C denote the set of nodes and edges lying in the j- 
th SCC, respectively. Using this structure, one can reduce the original digraph $f to the so called 
condensation digraph '^*={y* by substituting each node set of each SCC '^i of 5^ with a 
distinct node v* S '^*of and introducing an edge in ^* from v* to v* if and only if there exists 
some edges from the i-th SCC and the j-th SCC ^ Ch. 3.2]. An SCC that is reduced to the 
root node of a directed tree of the condensation digraph is called Root SCC (RSCC). Observe that, by 
definition, the condensation digraph has no cycles [Mi Lemma 3.2.3]. Building on this property, we 
have the following. 

Lemma 1 Let'i^*={'f*,(o*} be the condensation digraph of^, composed by K nodes. Then, the nodes 
of 'S* can always be ordered as Vi, . . . G y*, so that all the existing edges in are in the form 

iv*,v^) € (f*, with l<j<i<K, (24) 

where Vi has zero in- degree. 

The ordering Vi, . . . , satisfying (j24p can be obtained by the following iterative procedure. Start- 
ing from Vi, remove and all its out-coming edges from 5^*. Since the reduced digraph with K — 1 
nodes has no (strong) cycles by construction, there must exist a node with zero in-degree in it. Let us 
denote such a node by Then, no edges in the form (^2,1^*), with j > 2, can exist in the reduced 
digraph (and thus in W*). This justifies ([MD for i = 2 and j = 1,2. The rest of dM]), for j > 2, is 
obtained by repeating the same procedure on the remaining nodes. 

The connectivity properties of a digraph are related to the structure of its condensation digraph, 
as given in the following Lemma (we omit the proof because of space limitations). 

Lemma 2 Let ^* be the condensation digraph of ^ . Then: i) 'S is SC if and only if^^is composed 
by a single node; ii) is QSC if and only if ^* contains a spanning directed tree; Hi )if'^is WC, then 
contains either a spanning directed tree or a ( weakly ) connected directed forest. 

The concept of condensation digraph is useful to understand the network synchronization behavior, 

as shown in Section [3 A similar idea was already used in [TUl EI] to study leadership problems in 

coordinated multi-agent systems. 

^''Maximal means that there is no larger SC subgraph containing the nodes of the considered component. 
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A. 2 Spectral properties of a Digraph 

We recall now some basic relationships between the connectivity properties of the digraph and the 
spectral properties of the matrices associated to the digraph, since they play a fundamental role in the 
stability analysis of the system proposed in this paper. In the following, we denote by Iat and Oat the 
A^- length column vector of all ones and zeros, respectively. 

Given a digraph ^ ={'f,S'}, we introduce the following matrices associated to Sf: 

• The N X N adjacency matrix A is composed of entries [A]^^- = aij, i,j = l,...,N, equal to the 
weight associated with the edge eij, if eij E S", or equal to zero, otherwise; 

• The degree matrix A is a diagonal matrix with diagonal entries [A]^^ = degjj^(i'i); 

• The (weighted) Laplacian L is defined as 

A I CL^k, if ? = i, 

m^, = { ^M*=i (25) 

[ -aij if j i. 

Using the adjacency matrix A and the degree matrix A, the Laplacian can be rewritten in 
compact form as L = A — A0 

By definition, the Laplacian matrix L in (j25p has the following properties: i) it is a diagonally 
dominant matrix [43]; ii) it has zero row sums; and iii) it has nonnegative diagonal elements. From 
i)-iii), invoking Gersgorin's disk Theorem [23], we have that zero is an eigenvalue of L corresponding 
to a right eigenvector in the Null{L} ^ spanjlAr}, i.e., 

L 1^ = 0^, (26) 

while all the other eigenvalues have positive real part. This also means that rank(L) < — 1. 

Moreover, from (|23p and (|26p . it turns out that balanced digraphs can be equivalently characterized 
in terms of the Laplacian matrix L: A digraph is balanced if and only if l^v is also a left eigenvector 
of L associated with the zero eigenvalue, i.e., 

l^L = 0?;, (27) 

or equivalently ^(L + L^) is positive semidefinite. 

The relationship between the connectivity properties of a digraph and the spectral properties of its 
Laplacian matrix are given by the following. 

Lemma 3 Let 'S ={'^, (^} he a digraph with Laplacian matrix L. The multiplicity of the zero eigenvalue 
of Ij is equal to the minimum number of directed trees contained in a spanning directed forest of^. 

Corollary 2 The zero eigenvalue ofL is simple if and only ifS contains a spanning directed tree (or, 
equivalently, it is QSC). 



Observe that the definition of Laplacian matrix as given in (|25p coincides with that used in the classical graph theory 
literature, except for the convention we adopted in the orientation of the edges. This leads to L expressed in terms of 
the in-degrees matrix, rather than of the out-degrees matrix. Our choice is motivated by the physical interpretation we 
gave to the edges' weights, as detailed in Section [2.21 
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Lemma [3] comes directly form Theorem 9 and Theorem 10 of |44] . Corollary [2] was independently 
proved in many papers, such as [9l Corollary 2], [20, Lemma 2]. Observe that, since the strong 
connectivity of the digraph implies QSC, the results provided in [U [21] for SC digraphs, can be 
obtained as special case of Corollary [2l Specifically, we have the following. 

Corollary 3 Let ^ ={'f,S'} be a digraph with Laplacian matrix L. //^ is SC, then L has a simple 
zero eigenvalue and a positive left- eigenvector associated to the zero eigenvalue. 

According to Corollary [21 because of ([26]) . the Laplacian of a QSC digraph has an isolated eigenvalue 
equal to zero, corresponding to a right eigenvector in the spanjlAr}. Observe that, for undirected 
graphs. Corollary [3] can be stated as follows: rank(L) = — 1 if and only if ^ is connected |39J. For 
directed graphs, instead, the "only if part does not hold. 

We derive now the structure of the left-eigenvector 7 of the Laplacian matrix L associated to the 
zero eigenvalue, as a function of the network topology. This result is instrumental to prove the main 
theorem of this paper. We have the following. 

Lemma 4 Let ^ ={'f, be a digraph with N nodes and Laplacian matrix L. Assume that ^ is QSC 
with K SCC's . . .,^K={yK,'^K}, with 1^ C r, (f, C ^, \r,\ = n and Y^^r^ = N, num- 

bered w.l.o.g. so that ^\ coincides with the RSCC of . Then, the left- eigenvector 7 = [7-,^, . . . ,77v] 
of L associated to the zero eigenvalue has the following structure 

li = { (28) 
1=0, otherwise. 

Lf^i is also balanced, then 7^,^ = [7-^, . . . ,7j.J"^ G spanjln}, where ri= ["/ij . 

Proof. Since the digraph is QSC, the set contains either all or < ri < nodes of 5^. 

In the former case, Yi = Y and thus the digraph is SC, by definition. Hence, according to Corollary 
[3|(cf. Appendix [X2]), the Lapl acian matrix L(^) has a simple zero eigenvalue with left-eigenvector 
7 > 0. If, in addition, is balanced (and thus also SC), then 7 G spanjlTv} • 

We consider now the latter case, i.e., < ri < A^. According to Lemma[2l the condensation digraph 
<^*={y*^ *} of contains a spanning directed tree composed by K nodes v^, . . . , G Y* (associated 
to the K SCCs '^i, ... with root node (associated to the SCC ^1 of '^). From Lemma [H we 

assume, w.l.o.g., that the nodes v^, . . . G i^* are ordered according to ([M]) . so that the Laplacian 
matrix of $f * be a lower triangular matrix. 

Using the relationship between the original digraph and its condensation digraph (cf. Ap- 
pendix [XTT]) , the Laplacian matrix L(^) of can be written as a lower block triangular matrix, i.e. 

/ Li ... \ 

* B2 '■• : 

* * • . 
\ * ... * Bk J 

with Li = L(^#l) G M^i^''i denoting the ri x ri Laplacian matrix associated to the root SCC '^i, and 
Bfc = Lfc + Dfc G M'''=^^'=, where = L(^fc) is the x Laplacian matrix of the k-th SCC 



(29) 
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of ^ and Djt is a nonnegative diagonal matrix whose i-th entry is equal to the sum of the weights 
associated to the edges outgoing from the nodes in ^^i, . . . , ^^yt-i and incoming in the i-th node in ^t-P^ 
Observe that, since is QSC, each has at least one positive entry; otherwise the SCO would 
be decoupled from the other SCCs. 

Since each is SC by definition, we have rank(Lfc) = — 1 (cf. Corollary [3]) and Null(Lfc) = 
span(lrj.) (see ([26|) ). Using these properties and the fact that D^l^j. 7^ 0^^., we have that the rows 
(columns) of + are linearly independent, or equivalently, 

rank(Bfc) = r^, VA: = 2, . . . , K. (30) 

Using (j29p and (jSOp . we derive now the structure of the left-eigenvector 'y of L(^) in ()29p . Parti- 
tioning 7 = [7^j,7^2' • • • ••'^'rj^ according to (f29l) . with 7,,^ G ^ we have 

7^^Li = 0^^ and 7^,Bfc = 0^^, /c = 2,...,K, (31) 

which, using Corollary [3] and ([30]) . provides 

7^^ > Ori and 7^^, = 0^, , /c = 2, . . . , i^, (32) 

respectively. If, in addition, ^1 is balanced, then 7^^ € spanjl^.-^} . ■ 

B Preliminary Results on Linear Functional Differential Equations 

In this section, we introduce some basic definitions on linear functional differential equations and prove 
some intermediate results that will be extensively used in the proof of Theorem [H as given in Appendix 

o 

To formally introduce the concept of functional differential equations, we need the following nota- 
tions: let be an A^-dimensional linear vector space over the real numbers with vector norm ||-|| ; de- 
noting by r the maximum time delay of the system, let C = C ([— t, 0], M^) [or = 0], M^)] 
be the Banach space of continuous (or continuously differentiable) functions mapping the interval 
[— r, 0] to with the topology of uniform convergence, i.e., for any (/> € C (or (/) G C^) the norm of 
^ is defined as \^\^ = sup_,-<^<o ||0('!9)||. 

Since in this paper we considered only linear coupling among the differential equations as given in 
([Tj), we focus on Linear homogeneous Delayed Differential Equations (LDDEs) with a finite number 
of heterogeneous non-commensurate delava^l [38]-[5T]: 

Xi{t) = fcj ^ aij {xj{t - Tij) - Xi{t)) , i = l,... ,N, t> to, (33) 



where ki is any arbitrary positive constant. Equation (j33p means that the derivative of the state 
variables x at time t depends on t and x('!9), for € [t — T,t]. Hence, to uniquely specify the evolution 



^^We assume, w.l.o.g., that in each SCC (^k the nodes are numbered from 1 to r^;. 

^^Observe that the LDDE (|33[) falls in the more general class of LDDE's studied in [48)-[50|. In fact, it is straightforward 
to check that LDDE ((33| can be rewritten in the canonical form of gHJ Ch. 6, Eq. (6.3.2)], [SO] Ch. 3, Eq. (3.1)]. 
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of the state beyond time to, it is required to specify the initial state variables x(t) in a time interval of 
length T, from to — "J" to to, i.e., 

Xi{to + ^) = (l)M, ^£[-T, 0], i = l,...,N. (34) 

Here after, the initial conditions cf) = {</>j}j are assumed to be taken in the set of (continuously 
differentiable) functions that are bounded in the nornl^ |0|^ = sup_^<^<o ||0(t^)|loo' i-^-' 

101,= sup 110(^)11^ </3<+oo I. (35) 

-r<i?<0 I 

Given cf) £ Cj^ and to, let x[to,</>](i) denote the function x at time t, with initial value cj) at time 
to, i.e., x(to + "!?) = (pi'd), with € [— r, 0]. For the sake of notation, we define x.[(pi\{t) = x.[0,4>]{t)- 
A function x[to, 0](i) is said to be a solution to equation ()33p on [to — to + ti) with initial value </> 
at time to, if there exist ti > such that: i) x € C ([to — r, to + ti], M") ; ii) x[to, <P]it) satisfies (p3]) . 
Vt G [to, to + ti]; and iii) x(to + ??) = (/)(i9), with t9 G [-r, 0]. It follows from [53 Theorem 1.2] that 
such a solution to equation (pSl) exists and is unique. We focus now on two concepts related to the 
trajectories of (j33p . namely boundedness and stability. 

Definition 2 Given system i33\) . a solution x[to, </>](t) is bounded if there exists a (3 = (3{to, </>) such 
that ||x[to, </>](t)|| < (3{to,4>) for t > to — t. The solutions are uniformly bounded if, for any a > 0, 
there exists a jS = (3{a) > such that for all to G M, </> G C and \(f)\^ < a we have ||x[to, 0](i)|| < /3 for 
all t > to- 

As far as the stability notion is concerned, it is not at all different from its counterpart for unretarded 
systems, except for the different assumptions on the initial conditions. The interested reader may refer, 
e.g., to |50[ I51j for an in-depth treatment of this topic. We focus here on effective methods on proving 
the stability of LDDEs. Since system (|33p is linear, the stability analysis can be carried out either 
in the time-domain [50\ [5T] or in the frequency-domain |48^ H9] , as for classical ordinary differential 
equations (see, e.g., [52] )■ In this paper we focus on the latter approach. The same conclusions can 
also be obtained using the time-domain analysis, based on Lyapunov-Krasovskii functional [53]. Before 
stating the major result, we need first the following intermediate definitions. 

Let C+ = {s G C : Re{s} > 0}, C_ = {s G C : Re{s} < 0}, and C+ be the closure of C+, i.e,,^ 
C+ = {s G C : Re{s} > 0}. Denoting by T^"^™- the set of n x m matrices whose entries are analytico 
and bounded functions in C+, let us introduce the N x N diagonal degree matrix A > OnxN and the 
complex matrix H(s) G C^^^ , defined respectively as 



A = diag{kidegi^{vi),...,kNdegi^{vN)) and [H(s)]j^- = <( 7 /' (36) 



0, if i = j, 

ruju/jjo - , if i ^ j, 

with degjn(t>i) given in ([22]) (see Appendix [X]) . Observe that H(s) G Ti^^^ . We can now provide the 
main result of this section, stated in the following lemma. 



^*We used, without loss of generality, as vector norm ||-|| in the infinity norm \\-\\^ , defined as ||x||^ = maxi \xi\. 

^^A complex function is said to be analytic (or holomorphic) on a region C C if it is complex differentiable at every 
point in T>, i.e., for any £ T) the function satisfies the Cauchy-Riemann equations and has continuous first partial 
derivatives in the neighborhood of zq (see, e.g., [5¥| Theorem 11.2]). 
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Lemma 5 Given system i33\). assume that the following conditions are satisfied: 

bl. The initial value functions 4> € C^, and the solutions x[(^](t) are hounded; 
b2. The characteristic equation associated to Ii33\) 



p{s) = det (si + A - H(s)) = 0, (37) 

with A and H(s) defined in i36\) . has all roots {sr}r £ with at most one simple root at 
s = oB 

Then, system I133\) is marginally stable, i.e., \/cf) G and Re{si} < c < 0, there exist ti and a, 
with to < ti < +00 and < a < +oo, independent of cf), and a vector x°°, with ||x°°|| < +oo, such that 

||x[</)](t) -x°°|| < a|(/)|,e"*, yt>ti. (38) 

Proof. Under assumption bl), according to [48 1 Theorem 6.5], the stabihty properties of system (13311 
are fuhy determined by the roots of the characteristic equation associated to ()33p . as detailed nexto 
Denoting hy A4 = {Re{sr} : pisr) = 0} the set of real parts of the characteristic roots {sr}r, it follows 
from \A8\ Theorem 6.7] that, V<^ G and c ^ M, there exist ti and a, with < ti < +oo and 
< a < +CXD independent on (jj, such that 



x[(/)](t) - lim V Pr(t)e'''-* 

Sr(=C,: Re{Sr}>C 



<a|0|,e'=*, Vt>ti, (39) 



where each p,, is a (vectorial) polynomial of degree less than the multiplicity of Sr, Ci denotes a contour 
in the complex plane of radius increasing with I and centered around s = (see [481 Sec. 4.1] for more 
details on how such contours Ci need to be chosen), and the sum in (j39p is taken over all characteristic 
roots Srwithin the contour Ci and to the right of the line Re{sr} = c. Observe that, since the number 
of such roots is finiteS [501 Theorem 1.5] (see also 081 Ch. 6.8 and Ch. 12]), the hmit in ([39]) is always 
well-defined. 

Using (j39p . we prove now that, under b2), system (j33p is marginally stable. Invoking the property 
that, for any given 7 G M, the number of characteristic roots Sr with Re{sr} > 7 is finite, one can 
always choose the constant c ^ in ([5^ so that Re{si} < c < Re{so} = 0, which leads to 

||x[0](t) -Poll < a 101,6'=*, yt>ti, (40) 

where we have explicitly used the assumption that the (possible) root sq = is simple and that there 
are no roots with positive real part. It follows from [48, Corollary 6.2] that this condition is also 
sufficient to guarantee that po is bounded, i.e., ||po|| < +00. Setting in (P0|) x°° = po, we obtain (pHj) . 
which completes the proof. ■ 



^°We assume, w.l.o.g., that the roots {sr} are arranged in nonincreasing order with respect to the real part, i.e., 

= Re{so} > Re{si} > Re{s2} > .... 

Observe that assumption bl) is only sufficient for the existence of the Laplace transform of the solutions to ((33}. 
■^■^The reader who is familiar with Linear Retarded Functional Differential Equations (LRFDE) may observe that this 
result comes directly from the fact that the characteristic equation (|37p does not have neutral roots ,48^ Ch. 12]. 
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Remark 1. It follows from Lemma [5] that, under assumptions bl)-b2), all the solutions to (j33p 
asymptotically convergence to the constant vector x°°. Moreover, equation (j38p provides an estimate 
of the convergence rate of the system: as for systems without delays, the convergence speed of (ISSp is 
exponential^^ with rate arbitrarily close to Re{si} that, in general, depends on network topology and 
delays. 

Remark 2. Lemma [5] generalizes results of [Ml [50], where the authors provided alternative conditions 
for the asymptotic stability of a Linear Retarded Differential Equation (LRDE). Interestingly, Lemma 
[5] contains some of the conditions of [iHl [50] as a special case: System ([33]) is asymptotically stable if 
all the characteristic roots of (|37p have negative real part [see ([38])]. This conclusion is the same as 
that for linear unretarded systems (see, e.g., [52]). 

Moreover, Lemma [5] can be easily generalized to include the cases in which one is interested in 
"oscillatory" behaviors of system (|33p . It is straightforward to see that bounded oscillations arise if 
assumption b2) is replaced by the following condition: All the roots of characteristic equation (|37p 
have negative real part and the roots with zero real part are simple. 



In the following, for the sake of notation simplicity, we drop the dependence of the state function from 
the observation, as this dependence does not play any role in our proof. 

C.l SufRciency 

We prove that, under al)-a3), the quasi strong connectivity of digraph ^ associated to the network in 
([7]) is a sufficient condition for the system ([7]) to synchronize and that the synchronized state is given 
by (|lip . To this end, we organize the proof according to the following two steps. 

We first show that, under al)-a2) and the quasi-strong connectivity of ^, the set of LRFDEs ([7]) 
admits a solution in the form 



if a = w*, where is defined in (jlip and {2;*q} are constants that depend in general on the system 
parameters and the initial conditions. This guarantees the existence of the desired synchronized state 
(cf. Definition [T]) . Then, invoking results of Appendix[Bl, we prove that, under al)-a3) and the quasi- 
strong connectivity of ^, such a synchronized states is also globally asymptotically stable (according 
to Definition [1]) . 

C.1.1 Existence of a synchronized state 

Let us assume that conditions al)-a2) are satisfied and that ={y,S'} associated to ([7]) is QSC. 
The synchronized state in the form (|4ip is a solution to ([7]) if and only if it satisfies equations ([7]) 
(cf. Appendix [BJ, i.e., if and only if there exist a and {x^q} such that the following system of linear 
equations is feasible: 



C Proof of Theorem [T] 



x*{t) = at + xlo, 



i = l,...,N. 



(41) 





(42) 



We say that x(t) converges exponentially toward x°° with rate r < if ||x(t) — x°°|| < O (e""*) . 
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where 

Auji{a) = giivi) - a i 1 + — ^ Oij Tij j . (43) 
V j&M J 

Introducing the weighted Laplacian L = L(?^) associated to digraph ^ (cf. Section [A]), the system in 
(j42|) can be equivalently rewritten as 

Lx5 = lDeAcj(a), (44) 

where Xq = [x^ q, . . . , q]-^, Dc = diag(ci, . . . , cat), and Aa;(a) = [Au}i{a), . . . , AujN{a)]'^ , with 
ALOi{a) defined in ([13]) . Observe that, under al)-a2) and the quasi-strong connectivity of 1^, the 
graph Laplacian L has the following properties (cf. Corollary [2]) : 

rank(L) = iV - 1, J\f{L) = span {Iat} , and Af(L^) = span {7} , (45) 

where A/'(L) denotes the (right) null-space of L, and 7 is a left eigenvector of L corresponding to the 
(simple) zero eigenvalue of L, i.e., 7"^L = 0"^. 

Assume now that a is fixed. It follows from (j45p that, for any given a, system (j44p admits a 
solution if and only if DcAa;(a) € span{L} . Because of (jl5|) . we have 

DcA cj(a) G span {L} ^ 7^DcA u{a) = 0. (46) 

It is easy to check that the value of a that satisfies ()46p is a = to*, with to* defined in (jlip . Hence, if 

a = uj*, the synchronized state in the form ([^T|) is a solution to ([7]), for any given set of {rij}, {gi}, 

{cj}, {aij} and K 0. The structure of the left eigenvector 7 associated to the zero eigenvalue of L 

as given in (jlip comes directly from Lemma [H 

Setting a = u!*, system admits 00^ solutions, given by 

x^ = ^L^DcA u{uj*) + span{l7v} = xq + span{l^}, (47) 
K 

where 

xo^^L«DeAcj(u;*), (48) 
Auji{uj*) is obtained by (|43p setting a = uj* and is the generalized inverse of the Laplacian L [56j. 

C.1.2 Global Asymptotic Stability of the Synchronized State 

To prove the global asymptotic stability of the synchronized state of system ([7]), whose existence has 
been proved in Appendix IC.l.ll we use the following intermediate result (see Appendix [B] for the 
definitions used in the lemma). 



I^NxN 

Then, p[H{s)) is a subharmonia^ bounded (above) function on 



Lemma 6 ( [551 Theorem 2.2 )Let H(s) G n^xN ^^^^ p(H(s)) denote the spectral radius o/H(s) 



*See, e.g., [541 Ch. 12], [5S], for the definition of subharmonic function. 
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We first rewrite system ([7]) in a more convenient form, as detailed next. Consider tfie following 
change of variables 

^^{t)^Xi{t)-{uj*t + Xi^o), i = l,...,N, (49) 

where and {xifi} are defined in ([TT]) and (08]), respectively, so that the original system d?]) can be 
equivalently rewritten in terms of the new variables {^i{t)}i as 

'^SeM i = l,...,N, t>0, (50) 

^i(i9) = </),(i9) ^ 0i(i?) - cu'^i? - Xi,o, ^ e [-r, 0], 

where are the initial value functions of the original system ([7|). Using (j48p [see also ()42p . with 
a = a;*] , system ([50]) becomes 

4^i{t) = k,Y^ aij (^'j(t - Tij) - ^^{t)) , 

jeM z = l,...,iV, i>0, (51) 

M',(^?) =0,(^?), i9g[-t, 0], 

where, for the sake of convenience, we defined ki = K/ci > 0, for i = 1, . . . ,N. 

It follows from (j5ip that the synchronized state of system ([7]), as given in (j4ip . is globally asymp- 
totically stable (according to Definition [1]) if system in (jSip is marginally stable (cf. Appendix [B]). To 
prove the marginal stability of system (|5ip . it is sufficient to show that system (|5ip satisfies Lemma 
[5j To this end, we organize the rest of the proof in the following two steps. 

Step 1. We show that, under al), a3), all the solutions ^[(f)]{t) to system (j5ip are uniformly bounded, 
as required by assumption bl) of Lemma [Sj 

Step 2. We prove that, under al)-a3) and the quasi-strong connectivity of the digraph, the charac- 
teristic equation associated to system has all the roots in C_ and a simple root in s = 0, 
which satisfies assumption b2) of Lemma [5l 

Step 1. Given any arbitrary P < +oo, assumptions al), a3) are sufficient to guarantee that all the 
trajectories of (j5ip are uniformly bounded (cf. Definition [2|) , as shown next. Since cj) € C^, we have 
(see dSS])) 

\^^il9)\<(3, Vi = l,...,iV, ^G[-T, 0]. (52) 

Condition (|52p is sufficient for {^'j[(/)](f)} to be uniformly bounded for all t > 0. In fact, assume that 
{^'j[0](t)} are not bounded. Then, according to Definition [2] of Appendix [Bl there must exist some 
t> and a set J' C {1, . . . , AT} such tha10 

||*(t)IL<A Vt<t, (53) 



and 

I*, (I) I =,3 and = = jej. 

\ > 0, if *i (t) = /J, 

This would imply that, for some t > t, (t) I > (3. 

Given ([53P - ([MP and j G J, we have two possibilities, namely: 



(54) 



^For the sake of notation, we omit in the foUowing the dependence of ^[cj)](t) on (p. 
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i) ^fj. (t) = p and ^'j(t) > 0. Since - Tji)\ < (3, = 1, . . . , N (see §3^)) and A;^ > 0, from §^ 

we have 

= ('^^^^ - ^i^) - *i (^)) = 2^ (^*(^ - "^i^) - /5) < 0' (55) 

where in the last inequahty, we used ([52]) . Expression ([55]) contradicts the assumption ^'j(t) > 0. 

ii) "i/j (t) = —[i and ^j(t) < 0. However, since 

^j© = % (^^(^ " ^J'^) " = J2 "J'^ (^^(^ " ^i*) + /?) > 0, (56) 

a contradiction with ^'j(t) < results. 

Thus, any solution ^[(f)](t) to ([5T|) is bounded and remains in for any t > 0. 
Step 2. We study now the characteristic equation ([37|) associated to system ([5T]) . assuming that 
al)-a3) are satisfied and that digraph 1^ is QSC. First of all, observe that, since A — H(0) = KJDcL, 
we have 

p{0) = det (A - H(0)) = K det (D^) det (L) = 0, (57) 

where A and H(s) are defined in (f36|l . and the last equality in (f57|l is due to the properties of 
Laplacian matrix L = L(^^) of digraph ^ (see ([26j) ). It follows from (|57j) that p{s) has a root in s = 0, 
corresponding to the zero eigenvalue of the Laplacian L (recall that det (Dc) 7^ 0). Since the digraph 
is assumed to be QSC, according to Corollary [21 such a root is simple. 

Thus, to complete the proof, we need to show that p{s) does not have any solution in C+\{0}, i.e., 

det (sI+A - H(s)) / 0, Vs G C+\{0}. (58) 

Since sI+A is nonsingular in C-|-\{0} [recall that, under al), A > 0, with at least one positive 
diagonal entry], ([58]) is equivalent to 

det (l-(sI+A)~^H(s)) / 0, Vs G C+\{0}, (59) 

which leads to the following sufficient condition for ([58]): 

pis) = p ((sI+A)-i H(s)) < 1, Vs G C+\{0}. (60) 

Since (sI+A)"^ G H^x^ and H(s) G n^^^ (cf. Appendix [B]) , it follows from Lemma [6] that the 
spectral radius p (s) in (|60p is a subharmonic function on C+. As a direct consequence, we have, among 
all, that p (s) is a continuous bounded function on cj^fl and satisfies the maximum modulus principle 
(see, e.g., [5?i Ch. 12]): p{s) achieves its global maximum only on the boundary of C+0 Since p{s) 
is strictly proper in C+, i.e., p (s) — > as \s\ +oo while keeping s G C+, it follows that 

sup p{s) < sup p{s) < sup p {jij) . (61) 
Observe that p (s) is well-defined in s = 0, and p (0) = 1. 

According to the ma:x;imum modulus theorem [541 Theorem 10.24], the only possibility for p{s) to reach its global 
maximum also on the interior of C+ is that p(s) be constant over all C+, which is not the case. 
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Using (|6T|) . we infer that condition (j60j) is satisfied if 

For any matrix norm ||- 
row sum matrix norm I 



(62) 

since p{A) < i|A|| VA G C^^*-^ [43, Theorem 5.6.9], using the maximum 
^ defined as [431 Definition 5.6.5] 

M 



max > [ Al 
=l,...,iV^I^ 
9=1 



rq 



(63) 



we have 



(jc^I+A)-i H(ic^) 



max 
T-=l,...,Af 



max 

r=l,...,N 



< 1, 



E 



/c^. O- 



rq 



juj + kr degin(fr.) 



(64) 



juJ + kr degin(l'r 

where in the last inequahty the equahty is reached if and only if w = 0. It follows from (j64p that 
p (juj) < 1 for all Lo ^ 0, which guarantees that condition ()60p is satisfied. 

This proves that assumption b2) of Lemma [5] holds true. Hence, all the trajectories 'J'[0](t) — > 
as t +00, with exponential rate arbitrarily close to r = {minj Re{si} : p{si) = and Sj 7^ 0}, where 
p{s) is defined in (j37p and satisfies the linear system of equations L^'"" = 0, whose solution is 
€ spanlljv} (Corollary [2|). In other words, system ()5ip exponentially reaches the consensus on 
the state. ■ 

C.2 Necessity 

We prove the necessity of the condition by showing that, if the digraph of d?]) is not QSC, different 
clusters of nodes synchronize on different values. This local synchronization is in contrast with the 
definition of (global) synchronization, as given in Definition [TJ Hence, if the overall system has to 
synchronize, the digraph associated to the system must be QSC. 

Assume that the digraph associated to ([7]) is not QSC, but WC with K SCCs and, let us say, 
r < K (distinct) RSCCs . Then, according to Lemma [21 the condensation digraph '^^*={'V* ,S'*} 
contains a spanning directed forest with r distinct roots (associated to the r RSCCs of the K SCCs of 
^^). Ordering the nodes v^, . . . , S according to Lemma[Tl and exploring the relationship between 
5f*and ^ (cf. Appendix IC.l.ip . one can write the Laplacian matrix L = L(^) as an r-reducible matrix 
i.e., 
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where the first r diagonal blocks are the Laplacian matrices of the r RSCCs of ^, and the {B^}, with 
k > r, are the nonsingular matrices associated to the remaining SCCs. Each of these matrices can be 



29 



written as the linear combination of the Laplacian matrix of the corresponding SCC and a nonnegative 
diagonal matrix with at least one positive diagonal entry. The structure of L given in (j65p shows that 
the RSCCs associated to the first r diagonal blocks are totally decoupled from each other. Hence, (at 
least) the state derivatives of the nodes in each of these r RSCCs reach a common value (since the 
corresponding subdigraphs are SC by construction) that, in general, is different for any of the SCCs. 
This is sufficient for the overall system not to reach a global synchronization. 
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